Search CORE

35 research outputs found

GEML: A Grammatical Evolution, Machine Learning Approach to Multi-class Classification

Author: A Kattan
A Kattan
A Mojsilović
C Downey
C Ji
DB Fogel
ER Hruschka
F Pedregosa
H Pan
H Steinhaus
K Neshatian
L Breiman
L Muñoz
M Castelli
M Keijzer
M Zhang
MC Cowgill
NS Altman
RC Barros
RE Schapire
RMA Azad
S Belhassen
S Deodhar
TG Dietterich
U Bhowan
U Maulik
UN Raghavan
W Smart
Y Ren
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

In this paper, we propose a hybrid approach to solving multi-class problems which combines evolutionary computation with elements of traditional machine learning. The method, Grammatical Evolution Machine Learning (GEML) adapts machine learning concepts from decision tree learning and clustering methods and integrates these into a Grammatical Evolution framework. We investigate the effectiveness of GEML on several supervised, semi-supervised and unsupervised multi-class problems and demonstrate its competitive performance when compared with several well known machine learning algorithms. The GEML framework evolves human readable solutions which provide an explanation of the logic behind its classification decisions, offering a significant advantage over existing paradigms for unsupervised and semi-supervised learning. In addition we also examine the possibility of improving the performance of the algorithm through the application of several ensemble techniques

Crossref

Birmingham City University Open Access Repository

BCU Open Access

Factors that influence the characteristics of needles and syringes used by people who inject drugs in Tajikistan

Author: A Latypov
Alisher Latypov
B Miles Matthew
David Otiashvili
DJ Hruschka
E Paintsil
ER Pouget
Georgiy V. Bobashev
GV Bobashev
HR Bernard
Irma Kirtadze
JL Fleiss
JL Syvertsen
JP Grund
JR Landis
MD Gaughwin
N Abdala
N Ishak
N Walsh
N Walsh
P Higgs
P Vickerman
R Senbanjo
S Koester
SL Bailey
U Ibragimov
Umedjon Ibragimov
W Zule
WA Zule
WA Zule
William A. Zule
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Natural variation in expression of genes involved in xylem development in loblolly pine (Pinus taeda L.)

Author: A Hartemink
A Patzlaff
A Patzlaff
A Stahlberg
Andrew J. Eckert
AV LeBude
B Boyle
Barry Goldfarb
C Alonso-Blanco
C Alonso-Blanco
C Bomal
C Plomion
CA Loopstra
Candace M. Seeve
Carol A. Loopstra
CJ Needham
D Falush
DB Rowe
DL Auger
DR Gang
ER Hruschka
F Liu
F Nicol
G Schindelman
I Allona
I Nachman
J Yu
JD Storey
JFD Dean
JH Ward
JJB Keurentjes
JK Pritchard
JP Townsend
K Basso
LM Steinmetz
M Bansal
M Vuylsteke
MF Oleksiak
N Friedman
P Kaothien
PCH Ma
PN Benfey
R Levesque
R Murthy
R Suzuki
R Zhong
R Zhong
R Zhong
R Zhong
RB Turley
RR Sederoff
RW Whetten
S Chang
S Persson
S-H Yang
S-H Yang
S-H Yang
SF Altschul
Sreenath Reddy Palle
T Mitchell-Olds
VG Cheung
W Bao
W. Patrick Cumbie
Y Zhang
YB Linhart
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Evolving clusters in gene-expression data

Author: Campello RJGB
de Castro LN
Hruschka ER
Publication venue: 'Elsevier BV'
Publication date: 26/11/2015
Field of study

Clustering is a useful. exploratory tool for gene-expression data. Although successful applications of clustering techniques have been reported in the literature, there is no method of choice in the gene-expression analysis community. Moreover, there are only a few works that deal with the problem of automatically estimating the number of clusters in bioinformatics datasets. Most clustering methods require the number k of clusters to be either specified in advance or selected a posteriori from a set of clustering solutions over a range of k. In both cases, the user has to select the number of clusters. This paper proposes improvements to a clustering genetic algorithm that is capable of automatically discovering an optimal number of clusters and its corresponding optimal partition based upon numeric criteria. The proposed improvements are mainly designed to enhance the efficiency of the original clustering genetic algorithm, resulting in two new clustering genetic algorithms and an evolutionary algorithm for clustering (EAC). The original clustering genetic algorithm and its modified versions are evaluated in several runs using six gene-expression datasets in which the right clusters are known a priori. The results illustrate that all the proposed algorithms perform well in gene-expression data, although statistical comparisons in terms of the computational efficiency of each algorithm point out that EAC outperforms the others. Statistical evidence also shows that EAC is able to outperform a traditional method based on multiple runs of k-means over a range of k. (C) 2005 Elsevier Inc. All rights reserved.176131898192

Repositorio da Producao Cientifica e Intelectual da Unicamp

The construction of causal networks to estimate coral bleaching intensity

Author: Gherardi DFM
Hruschka ER
Kikuchi RKP
Krug LA
Leão ZMAN
Stech JL
Suggett DJ
Publication venue: 'Elsevier BV'
Publication date: 01/04/2013
Field of study

Current metrics for predicting bleaching episodes, e.g. NOAA's Coral Reef Watch Program, do not seem to apply well to Brazil's marginal reefs located in Bahia state and alternative predictive approaches must be sought for effective long term management. Bleaching occurrences at Abrolhos have been observed since the 1990s but with a much lower frequency/extent than for other reef systems worldwide. We constructed a Bayesian Belief Network (BN) to back-predict the intensity of bleaching events and learn how local and regional scale forcing factors interact to enhance or alleviate coral bleaching specific to Abrolhos. Bleaching intensity data were collected for several reef sites across Bahia state coast (~12°-20°S; 37°-40°W) during the austral summer 1994-2005 and compared to environmental data: sea surface temperature (SST), diffuse light attenuation coefficient at 490 nm (K490), rain precipitation, wind velocities, and El Niño Southern Oscillation (ENSO) proxies. Conditional independence tests were calculated to produce four specialized BNs, each with specific factors that likely regulate bleaching intensity. All specialized BNs identified that a five-day accumulated SST proxy (SSTAc5d) was the exclusive parent node for coral bleaching producing a total predictive rate of 88% based on SSTAc5d state. When SSTAc5d was simulated as unknown, the Thermal-Eolic Resultant BN kept the total predictive rate of 88%. Our approach has produced initial means to predict beaching intensity at Abrolhos. However, the robustness of the model required for management purposes must be further (and regularly) operationally tested with new in situ and remote sensing data. © 2013 Elsevier Ltd

OPUS - University of Technology Sydney

Modified genetic algorithm-based clustering for probability density functions

Author: Cinlar E
Falkenauer E
Goh A
Hruschka ER
Martinez WL
Safe M
Tai Vo-Van
Thao Nguyen-Trang
Trung Nguyen-Thoi
Trung Vo-Duy
Vinh Ho-Huu
Publication venue: 'Informa UK Limited'
Publication date
Field of study

Crossref

Fuzzy C-Means Cluster Analysis Based on Variable Length String Genetic Algorithm for the Grouping of Rock Discontinuity Sets

Author: AK Jain
CW Duncan
D Marcotte
E-chuan Yan
ER Hruschka
ER Hruschka
J Zheng
JC Bezdek
K Krishna
LM Xu
PHSW Kulatilake
PHSW Kulatilake
Q Lei
R Jimenez
R Jimenez-Rodriguez
R Srikanth
RE Hammah
RE Hammah
RJ Shanley
S Bandyopadhyay
S Salimzadeh
S Song
U Maulik
U Maulik
W Zhou
XL Xie
Xuejie Cui
Y Li
YC Chiou
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Introducing interactive evolutionary computation in data clustering

Author: A Ben-Israel
A Brintrup
B Everitt
CJ Goodwin
ER Hruschka
G Menardi
H Takagi
J Kihlstrom
KL Du
MJ Abul Hasan
O Miglino
PJ Rousseeuw
R Xu
S Bandyopadhyay
T Lumley
Y Yang
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Data clustering consists in finding homogeneous groups in a dataset. The importance attributed to cluster analysis is related to its fundamental role in many knowledge fields. Often data clustering techniques are the ghost host of many innovative applications for a wide range of problems (i.e. biology, marketing, customers segmentation, intelligent machines, machine translation, etc.). Recently, there is an emerging interest in Data Clustering community to develop bio-inspired algorithms in order to find new methods for clustering. It is widely observed that bio-inspired algorithms and the Evolutionary Computation (EC) techniques reach solutions similar to others computational approaches but using a bigger computational power. This limitation represents a concrete obstacle to an extensive use of Evolutionary (or bio-inspired) approach to data clustering applications. In the present paper we propose to use Interactive Evolutionary Computation (IEC) techniques where a human being (the breeder) selects Cluster configurations (genotypes) on the basis of their graphical visualizations (phenotypes). We describe a first version of a software, called Revok, that implements the IEC basic principles applied to data clustering. In the conclusion section we outline the necessary steps to reach a mature IEC tool for data clustering

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

Adaptive crossover memetic differential harmony search for optimizing document clustering

Author: AK Uysal
D Karaboga
ER Hruschka
F Neri
H Abedinpourshotorban
I Al-Jadir
J Zhang
JE Smith
KK Bharti
KK Bharti
M-T Vakil-Baghmisheh
P Chakraborty
P Lučić
QH Nguyen
R Forsati
R Forsati
S Das
XZ Gao
Publication venue: Springer Verlag
Publication date: 01/01/2018
Field of study

An Adaptive Crossover Memetic Differential Harmony Search (ACMDHS) method was developed for optimizing document clustering in this paper. Due to the complexity of the documents available today, the allocation of the centroid of the document clusters and finding the optimum clusters in the search space are more complex to deal with. One of the possible enhancements on the document clustering is the use of Harmony Search (HS) algorithm to optimize the search. As HS is highly dependent on its control parameters, a differential version of HS was introduced. In the modified version of HS, the Band Width parameter (BW) has been replaced by another pitch adjustment technique due to the sensitivity of the BW parameter. Thus, the Differential Evolution (DE) mutation was used instead. In this paper the DE crossover was also used with the Differential HS for further search space exploitation, the produced global search is named Crossover DHS (CDHS). Moreover, DE crossover (Cr) and mutation (F) probabilities are dynamically tuned through generations. The Memetic optimization was used to enhance the local search capability of CDHS. The proposed ACMDHS was compared to other document clustering techniques using HS, DHS, and K-means methods. It was also compared to its other two variants which are the Memetic DHS (MDHS) and the Crossover Memetic Differential Harmony Search (CMDHS). Moreover, two state-of-the-art clustering methods were also considered in comparisons, the Chaotic Gradient Artificial Bee Colony (CGABC) and the Differential Evolution Memetic Clustering (DEMC). From the experimental results, it was shown that CMDHS variant (the non-adaptive version of ACMDHS) and ACMDHS were highly competitive while both CMDHS and ACMDHS were superior to all other methods

Crossref

Research Repository

A transparent rule-based expert system using neural network

Author: A Bondarenko
A Gupta
A Karim
AK Mann
AK Sharma
CJ Mantas
ER Hruschka
G Bologna
HH Dam
J Han
K Jivani
K Kaikhah
K Odajimaa
M Craven
M Mashayekhi
M Shridhar
MG Augasta
P Kaviani
R Setiono
R Setiono
R Setiono
S Cohen
SK Biswas
T Botari
V Sing
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref